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Abstract 

We study invasion percolation on Aldous' Poisson-weighted infinite tree, and derive two 
distinct Markovian representations of the resulting process. One of these is the a — ► oo limit 
of a representation discovered by Angel, Goodman, den Hollander and Slade [2]. We also 
introduce an exploration process of a randomly weighted Poisson incipient infinite cluster. 
The dynamics of the new process are much more straightforward to describe than those of 
invasion percolation, but it turns out that the two processes have extremely similar behavior. 
Finally, we introduce two new "stationary" representations of the Poisson incipient infinite 
cluster as random graphs on Z which are, in particular, factors of a homogeneous Poisson 
point process on the upper half-plane M x [0, oo). 

Keywords: Invasion percolation, Prim's algorithm, Poisson-weighted infinite tree, Percola- 
tion, Random trees. 
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1 Introduction 

Invasion percolation (or Prim's algorithm [20]) was first introduced by Jarriik [14] as a procedure 
for constructing the minimum weight spanning tree of a connected, weighted, finite graph. The 
procedure, however, may be applied to many infinite graphs without modification. Given a con- 
nected graph G = (V,E), a starting node vq € V, and an injective weight function w : E —> R, 
the algorithm grows a component from the root inductively, adding at each step the lowest weight 
edge leaving the current component. 

For each i = 0, 1, . . . 

1. If {vo, . . . , Vi} = V then stop. 

2. Otherwise, let e = uv <E E be the smallest weight edge for which u 6 

{v , . . .,Vi}, V <£ {v , . . ,,Vi}. 

3. Let fi+i = v and let e.^+i = uv. 

(Throughout the paper, the graphs and weight functions we consider will be such that step 2, 
above, is well defined, i.e., the infemum of the weights of all edges from {w , . . . , Vi} to the rest of 
the graph is attained.) If \V\ < oo, the resulting graph with vertex set {v , . . . ,t>|w_i} and edge 
set {ei, . . . , e|y|_i} is the unique minimum weight spanning tree of G. However, in general, for an 
infinite graph, this procedure does not necessarily build a spanning subgraph of G. In particular, if 
there is an infinite path leaving vq and containing only edges of weight at most h, for some h € K., 
then no vertex v for which inf e3 „ w(e) > h will ever be explored. 
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Prim's algorithm was rediscovered under the name of invasion percolation in the 1980's [5, 
16]. The strong connection between invasion percolation and critical percolation was immediately 
recognized - a particularly nice example of this connection is contained in the fact that invasion 
percolation on Z d occupies an asymptotically zero proportion of the vertices of 1 d if and only if 
the percolation probability at the critical point p c {'^ d ) is zero (see Newman [19], page 24). 

The well-known heuristic that percolation-style processes on 1 d should behave like percolation 
on a regular tree when d is large led Angel, Goodman, den Hollander and Slade [2] to study invasion 
percolation on regular trees. Angel et. al. prove far too many results for us to summarize here. 
Among other topics, they study volume growth and boundary growth, spectral and Hausdorff 
dimensions for the set of vertices explored by invasion percolation (we hereafter refer to this set - 
and to the subgraph induced by this set, which will cause no confusion - as the invasion percolation 
cluster) . Their results all stem from a Markovian representation of the invasion percolation cluster 
as - informally - a single infinite path, at each point of which is attached an independent random 
tree (these trees are "subcritical Bernoulli percolation clusters" with a parameter which becomes 
increasingly close to critical the further along the backbone they are attached). One of the major 
purposes of our paper is to explore a new approach to this structural representation which applies 
in some generality, so we take a moment to explain the representation itself in more detail. 

For the duration of the introduction, for integers a > 2, let T a denote the infinite rooted ir- 
regular tree (each node except the root has degree a + 1), with each edge e labelled by U e ~ 
Uniform[0, a] independently of all other edges. In general, for a weighted rooted graph G, let G(p) 
be the connected subgraph of G containing the root when all edges of weight greater than p are 
discarded. Let p\ = inf{p : T a (p) is infinite}. Then with probability one, 1 < p\ < a, and T a {pi) 
is infinite and contains precisely one edge e\ of weight p\ (this is not hard, and in particular follows 
from Corollary 22 in Section 2.3). The component of T a (pi) containing the root when e\ is removed 
is finite (or else we never would have explored edge ei). Let i be the component of T a (pi) not 
containing the root when e\ is removed; then T a i, which we view as rooted at its unique vertex 
which is an endpoint of e±, is infinite and contains only edges of weight less than p\ . Supposing we 
have defined p±, . . . ,pi, e\, . . . , e,, and T a ,i, . . . , 7^, let Pi+x — inf{p : %r,i(p) is infinite}. Then 
with probability one, 1 < Pi+\ < Pi, and T a- i{pi + x) is infinite and contains precisely one edge e^+i 
of weight Pi+i, which separates the root of 7^ from infinity. We define T^+i to be the component 
of T a i {p i+ i) not containing the root when ej+i is removed, and root this tree at its unique vertex 
which is an endpoint of ej+i. 

Now let P — {/i}£i be the unique path starting from the root of T a and passing through all 
of {&j}fLi (so P is only a.s. defined). This path is called the backbone of the invasion percolation 
cluster. The components of the invasion percolation cluster when all edges in {ej}j C L 1 are removed 
are called ponds; Angel et. al. also study the sizes of these ponds. There is further interesting 
recent work on invasion percolation: on the sizes of ponds for invasion percolation in 1? [6, 7]; and 
on rescaled invasion percolation on trees [3]. For integers n > 1, let W n — W n (T a ) — sup J>n Uf r 
Angel et. al. term the process {Wn}5£Li the backbone forward maximal process of T a . W n is non- 
increasing and has lirrin^oo W n = 1. Note that W n > W n +x only when f n is one of the edges ej, 
in which case W n = Uf n — pj. Angel et. al. prove that {W„ }$£!-! is a Markov process, and specify 
both its transition probabilities and its large-n rescaled behavior. 

The removal of the vertices and edges of P separates the cluster into components of finite size. 
Suppose T is one such cluster and that its neighbor on the path P has distance n from the root. 
Then Angel et. al. show that T is distributed as T a (W n ) conditioned to stay finite, independently 
of all other components. This fact and the results about the backbone forward maximal process 
mentioned in the preceding paragraph form the heart of their structural results. 

In this paper we introduce a new mechanism for studying invasion percolation on randomly 
weighted trees, which can in particular give a new perspective on the structural results of Angel 
et. al. The methodology works in some generality - in fact, parts of it are most easily formulated 
as statements about invasion percolation on graphs with deterministic weights. To apply such 
results, one then needs to check that the hypotheses hold a.s. in a randomly weighted tree under 
consideration (which in practice is always a trivial matter) . We have chosen to present our results 
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in the setting where they are the most simple and striking, which is that of the Poisson-weightcd 
infinite tree, or PWIT. 

Informally, the PWIT can be described as follows. The root r has a countably infinite number 
of children V\, V2, ■ ■ ■■ The edges rv\, rv 2 , ■ • ■ are assigned weights: for each i > 1 the edge ru, is 
weighted with the position of the i'th point of a homogeneous Poisson process of rate 1 on [0, 00). 
(Equivalently, starting from an infinite sequence of independent Exponential 1) random variables 
Ei, E 2 , ■ ■ ., for each i the edge rvt is given weight Ei + . . . + Ei.) This construction is repeated 
independently and recursively at each child of the root. We may view the nodes of the PWIT 
as labelled by Uilo^ 1 ' so that the root has label and in general, node nin 2 ■ ■ ■ has parent 
nin.2 . . . fifc_i and children {nin 2 ■ ■ ■ fifc^lraeNi however, this labelling will not play a major role in 
the paper. 

The PWIT shows up as a standard large-n limit for combinatorial optimization problems on the 
complete graph K n (sec the excellent survey paper by Aldous and Steele [1] for details of how). Our 
case is no exception; as one consequence of our study, we obtain novel proofs of the main results of 
[17], about the early behaviour of Prim's algorithm on K n with i.i.d. uniform weights. Our main 
results, however, link invasion percolation on the PWIT with the Poisson incipient infinite cluster 
- IIC, for short - constructed for general critical branching processes by Kesten [15], but earlier 
in the Poisson case by Grimmett [10]. The Poisson IIC is, informally, a critical Poisson Galton- 
Watson tree - PGW(l), for short - conditioned to be infinite. There are at least two natural ways 
to formalize this statement, but they both yield the same limiting construction, which we now 
describe. Start with a single, one-way infinite path, then make each node of the path the root of 
an independent copy of PGW(l). The resulting infinite tree is the Poisson IIC, which we denote 
by Tnc- 

For the remainder of the introduction, let 7o be a random weighted tree with the distribution 
of the subgraph of the PWIT explored by invasion percolation, with vertices {vq,vi, . . .} in order 
of exploration, and let {14^}°^ be its forward maximum process. (We have not yet proved that 
7o has a forward maximum process, although the proof is straightforward - in particular, this fact 
follows from Corollary 22 in Section 2.3.) Also, for any tree T and vertex v of T, let denote T 

re-rooted at v. For two rooted random graphs G, H, we write G = H to mean G and H have the 
same distribution in the local weak sense (neighborhoods of finite order of the root have the same 

distribution in both graphs; see [1], Section 2, for more details.). Similarly, we write G n — * G to 
denote local weak convergence of a sequence {G n } of rooted random graphs to a limiting random 
graph G. (This notion of convergence in distribution deals only with the topological structure of 
the graph, so in particular ignores any edge weights of the graphs under consideration.) 

Let Pbea homogeneous Poisson process of rate 1 in the upper half-plane R x [0, 00). Given two 
random variables X and Y, we say a random variable X is a factor of Y if almost surely X = f(Y) 
for some deterministic function /. (Usage of this term has not been fully standardized; ours agrees 
with that of [13].) The first main theorem of our paper is the following. 

Theorem 1. There exist two V-a.s. distinct random trees T — T(V), T' = T'(V) with vertex set 
7L such that 

(a) in T there is a unique infinite rightward path from each vertex V-a.s., 

(b) in T' there is a unique infinite leftward path from each vertex V-a.s., and 

(c) neither T nor T' is a factor of the other. 

Furthermore, setting U — T or U = T' , we have 

(d) for any neZ, U(V + n) = U(V) + n, and 

(e) for any n £ Z, is distributed as Tnc- 
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This theorem seems very similar in spirit to results of Ferrari, Landim and Thorisson [9], on 
tree and forest factors of Poisson processes in K d x R, d > 1 (the final copy of K viewed as a time 
dimension). The graph they define is a tree when d — 1, 2 and a forest when d > 3. Some particular 
similarities of note: Ferrari et. al. explain how to use a preorder traversal (or depth-first search, 
a procedure quite similar to invasion percolation) of the points of the Poisson process in order 
to view their trees as having vertex set Z; their graphs also have only one end (only one infinite 
path leaving any vertex); and their graphs are built by joining each point to its first time-successor 
within R d -distance one, yielding a "coalescing random walk" interpretation of the construction 
that is reminiscent of our random- walk description of the forward maximal process in Section 2.3. 
Ferrari et. al. do not explicitly identify the distribution of the graph they define, but it would be 
very interesting to know if it can be meaningfully interpreted as a higher-dimensional analogue of 
the Poisson IIC. Holroyd and Peres [12] have also studied tree and forest factors of Poisson point 
processes in R d , and Holroyd and Peres [12], Timar [23] have studied tree and forest factors of 
general point process in R d . Also, factors of one-dimensional Poisson processes that commute with 
discrete shifts (i.e. as in Theorem 1 (d), above) are one of the subjects studied in [11]. 

As a byproduct of the proof of Theorem 1, we will also obtain the following theorem, which is 
a "PWIT analogue" of [2], Theorem 1.2. 

Theorem 2. T (Vn) 4 T nc as n — > oo. 

Before stating our third theorem (in fact, the first two theorems lean heavily on tools introduced 
in proving the third), we have a few more concepts to introduce. For each edge e of 7nc> let 
X e ~ Uniform[0, 1), independently of all other edges. Let eo = vqV\, e\ — v\V2, ... be the edges 
of the unique infinite path (the backbone) in T\\q, let M = 0, and for integers i > 1, let Mj = 
max <j<; X e .. Now let 7^ c be the subtree of Tnc obtained as follows. Let v be a vertex of Tnc, 
and let Vi be the nearest vertex of the backbone to u. If any edge of the path from v to Vi has 
weight greater than Mi, then remove v from the tree. Do this for each v £ Tnc- Finally, remove 
vq and root at v±. The resulting subtree of Tie is T^ c . 

Theorem 3. There is a continuous, strictly decreasing bijective map q : [l,oo) — > (0,1] such 
that (q(Wi), q(Wi), . . .) = (Mi, M2, . . .), in the sense of finite dimensional distributions. Fur- 
thermore, 7q conditional on (W\,W2,- ■ ■) is distributed as T^ lc conditional on (Mi,M2,...) = 
(q(Wx), qiWi), ■ ■ ■), in the local weak sense. 

It is worth mentioning that the Markovian nature of (W\, W2, ■ ■ ■) can be immediately deduced 
from this theorem. Given Mi, Mi + i is greater than Mi precisely if X ei+1 £ (Mi, 1], in which case 
Mj+i = X ei+1 . Thus, given Mj, Mj+i is equal to Mj with probability (1 — Mi), and otherwise is 
uniform on (Mi, 1]. Translating this to Wi immediately yields the "PWIT analogue" of the Markov 
process construction [2, Proposition 3.1.]. 

1.1 The PWIT asaa^co limit of K a+l or of T a 

We mention in passing that with not much effort, it is possible to prove convergence of invasion 
percolation on T a or on K a+ i to invasion percolation on the PWIT, in a stronger sense than the 
local weak sense. Let U = (Ul , E/J, • • • , U*) be the order statistics of a independent Uniform[0, a] 
random variables. Then U tends weakly to the vector of points of a homogeneous rate one Poisson 
process V on [0,oo). More importantly for our current purpose, the vector (U\, . . . ,U\^\) has 
total variation distance 0(a~ 1 ' 2 ) from the vector of the first [-y/crj points of V . It follows, in a 
sense that can easily be made precise, that the first o(s/a) steps of invasion percolation on T a 
together have total variation distance o(l) from the same number of steps of invasion percolation 
on the PWIT. A similar statement holds for the first o(y / a) steps of invasion percolation on K a+ i. 
This in particular yields new proofs the explicit error bounds derived in [17] for the behavior of 
the early stages of Prim's algorithm on K a +i- The details are straightforward and we leave them 
to the interested reader. 
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1.2 Outline 



In Section 2 we construct the building blocks on which the remainder of the article rests. In 
particular, we describe a different way to view invasion percolation, in terms of a "note-taking" 
procedure that accompanies the invasion percolation procedure, and in the special case of invasion 
percolation on trees contains all the information required to reconstruct the original procedure. To 
best understand this note-taking procedure we introduce the "box process" (Definition 6), which 
gives us a clear picture of the connection mechanism of invasion percolation. The box process also 
allows for an understanding of a related "two-way infinite" invasion percolation process, which can 
be seen as describing the behaviour of invasion percolation far from the root. Furthermore, with 
the introduction of the "box graph" in Section 2.2, the box process itself becomes an interesting 
object of study, and we derive some of its fundamental properties. Throughout Section 2, our 
studies are in the deterministic setting. 

In Section 3 we apply our tools to study 7q. In particular, we prove the PWIT analogue of 
the forward maximal representation of T in more detail. Section 3 also contains some results 
concerning ballot style theorems, queueing processes and Poisson Galton- Watson duality that are 
of use in proving Theorems 1 through 3. 

Finally, in Section 4 we prove a number of results concerning the box graph and the stationary 
process. In particular, we find that these graphs resemble the Poisson IIC locally. Using these 
results, we deduce Theorems 1 through 3. 



2 Redrawing invasion percolation 

In this section we describe a different way to view invasion percolation which is at the heart of 
most of the results of this paper. First, imagine keeping notes of the local edge landscape we see as 
we perform invasion percolation, as follows. At step i of invasion percolation, we explore vertex Vi 
and record the weights of all edges leaving Vi and heading into new territory by putting marks on 
the vertical half-line {i} x [0, oo) whose heights are the weights of these edges. (When performing 
invasion percolation on a rooted tree T, the edges "heading into new territory" are precisely the 
edges from vi to its children in T.) Running the invasion percolation process until it terminates 
(or forever) then yields some set P of points in the positive quadrant. 

Formally, suppose G = (V, E) is a weighted graph with all edge weights distinct, and with 
distinguished vertex vq. Then the invasion percolation procedure defines an infinite subtree T of 
G, with vertex set {v ,v\, . . .}. For each i > 0, let pi(l),pi(2), . . . ,Pi(ji) be the weights of the 
edges from v t to V \ {vo, ■ ■ ■ in increasing order of weight. Let P l = P l {G) = {(i,Pi(j))}'jLi, 

and let P = P(G) = U^j^P*. 

In general, it is not possible to reconstruct the steps taken by invasion percolation by considering 
only the set P. However, this is possible for invasion percolation on trees, and we now explain 
how. In order to do so, we introduce an inductive procedure for building a tree, given a set of 
points PcR 2 and an interval X C Z of consecutive integers. We write n% = inf{n 6 X} > — oo 
and mj = sup{n el} < oo. 

For notational convenience, given X C IR 2 , we write \X\p for |PflX| . Also, for a point p € M 2 , 
we write x(p) for the x-coordinate and y(p) for the y-coordinate. Let us assume the following. 

1. All points of P lie in the upper half-plane. No bounded set contains unboundedly many 
points. 

2. For any b£I, there exists k > with n — k > nj — 1 for which |[n — k, n) x [0, oo)|p > k. 

3. |P((-oo, oo) x {y})\ < 1 for any y e K. 
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Figure 1: Top, an Z-reasonable set of points P, with X={0,...,13}, and the corresponding boxes 
(defined in Definition 6). Middle, the tree IPC(P, I). Bottom, the forest BG(P,Z), defined at the 
start of Section 2.2. All arrows point from child to parent. 



If P satisfies these three conditions, we say it is reasonable (or X-reasonable, if X is not clear from 
context). (Here, as well as later, we state deterministic requirements for the pointset P; these 
requirements - and therefore, the results derived from them - will hold almost surely for all the 
random point sets we consider. In particular, the reader will always be safe thinking of P as a 
Poisson point set of intensity one in the upper half-plane.) We start from an empty set P„ x = 0, 
from which we will build an increasing sequence of subsets of TVo- The following procedure requires 
nx > — oo. 

For each i = nx, n% + 1, . . . , nix, 

1. Let p i+ i = p i+ i(P,T) be the point of ([nj,i + 1) x [0, oo)) n (P \ Pi) minimizing 

y{pi+i)- 

2. Let P+i = P+i(P,X) = P U {p l+ i}, and let e l+1 = e l+1 (P,I) = (i + 1, [x(p l+1 )\). 

We refer to this procedure as point set invasion percolation. Since P is reasonable, the procedure 
is well-defined. The resulting graph IPC(P, I) has vertex set I and edge set {e^ : n% < i < mj+1}. 
(We write i < mj + 1 instead of i < mj since we may have mj = oo, but i is always finite.) An 
example is shown in Figure 1. Note that IPC(P, I) is a tree, which we view as rooted at nx- We 
often also view IPC(P,2~) as a weighted tree in which edge has weight y(pi). In general in this 
section we work in the deterministic setting. However, since our eventual aim is to link this work 
to invasion percolation on randomly weighted trees we briefly discuss how this can be done. 

IPC of the PWIT Now suppose that T is an instance of the PWIT, and let To be the subtree 
of T explored by invasion percolation. The following lemma is then immediate. 

Lemma 4. IPC(P(T),N) and Tq are identical, and for each i € N, w(ei) = y(pi). 

When performing invasion percolation on T, for all i, P l {T) is a Poisson point process of rate 
1 on the vertical half-line {i} x [0, oo), and P(T) is the union of these point processes. 

We remark that since all points in P have integer x-coordinates, the floor in step 2, above, has 
no effect. The use of the floor is to ensure that if a point p = (x,y) S P is replaced by a point 
p' = (x',y), as long as [^J = L^'J' the resulting graph IPC(P,2~) will be unchanged. As a result 
we obtain the following corollary. 

Corollary 5. Let V be a Poisson point process of rate 1 on [0,oo) x [0,oo). Then IPC(P,N) and 
7q are identically distributed. 
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Proof. Associate to each point p = (x(p),y{p)) of P(T) an independent uniform U p , and let p' 
be the point (x(p) + U p ,y(p)). Then P' = {p' : p 6 P} is a Poisson point process of rate 1 on 
[0,oo) x [0,oo), and IPC(V,N) and IPC(P,N) are identical. The result follows. □ 

This corollary reduces the study of the distributional properties of To to that of the distributional 
properties of 1PC(V, N), where V is a Poisson point process of rate 1 on [0, oo) x [0, oo). 

We also demonstrate how the two examples of invasion percolation described in Section 1 can 
be encoded by suitable point processes. 

IPC of an infinite randomly weighted er-regular tree. Let T a be the rooted regular tree 
with forward degree a > 2. We can model invasion percolation on X a as follows: for each nel = N, 
choose a independent, uniformly random points of [n, n + 1) x [0, a) (or of {n} x [0, a)). Let P be 
the union of all these points. 

The minimum spanning tree of the complete graph. Let K a +\ be the complete graph on 
a + 1 vertices. We may approximately model invasion percolation on a randomly weighted K a +i 
as follows: for each n 6 X = {0, . . . , a}, choose a ~n points independent, uniformly random points 
from the set [n, n + 1) X [0, a). Let P be the union of all these points. 

This representation is not exact due to the cycles in K n . For example, it is possible that the 
second least weight leaving the starting vertex is on the edge between the second and third vertices 
visited by Prim's algorithm. However, the probability of events of this type is asymptotically 
negligible for the first o(^fa) steps of the algorithm. 

The acyclicity of trees is what allows us to model them by a point process without reference to 
the order of exploration of vertices. In general - for invasion percolation on Z d , for example - it 
may still be possible to use some of the following methodology while jointly constructing the point 
process P and the exploration process "as we go" . However, we have not pursued this avenue of 
study. 

For the remainder of the section, we explore what properties we can derive about the point 
process invasion percolation procedure with as few restrictions on the point set P as possible. The 
next definitions and lemma provide an alternative geometric characterization of the connection 
rule used in the above inductive procedure, one that will be useful throughout the paper. 

Definition 6. Given an interval!, with n% > — oo, and an J -reasonable pointset P, for each iel 
with i > nx> let 

hi(P,X) = inf{/i : 3j eX,nj <j<i such that \[j,i) x [0, > i — j}. 

Let ti(P,I) be the minimum integer li S [nx,i) such that |[£j,i) x [0, hp t z(i)]\p = i — li, let 
Bi{P,X) = [li, i) x [0, hi], and let ti(P,I) be the unique point in P>i with y(ti) = hi. 

We often omit reference to the parameters P and I if the context is clear. 

We take a moment to observe that these functions are well-defined. It follows from condition 2 
that hi is finite, and from condition 1 that it is positive. The minimality of hi then implies the 
existence of a point p 6 P(Bi) such that y(p) — hi. The fact there is a unique such point follows 
from condition 3. 

Lemma 7. If nj > — oo and P is X-reasonable, then for all n £ X \ {ni}, we have t n = p n . 

Proof. It suffices to show (by condition 3) that y(t n ) = y(p n ). We prove this by induction on 
n. Clearly, the assertion holds for n — ni + 1. Assume n > nj + 1 and that U = pi for all 
rix + 1 < i < n. First, since |i? n |p = n — £ n and U™~ nx+1 pi contains most n — £ n — 1 points of 
P n B n , the set (P \ P n -i) H B n contains at least one point and so y(p n ) < y(t n )- 
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To show that y(t n ) < y(p n ), first note that if x(p n ) > n — 1 then \[n — l,n) x [0,j/(p n )]|p > 1 
and so certainly y(t n ) < y(p n )- We thus assume that x(p n ) < n — 1, and construct a sequence 
{ a i}i=o inductively as follows: 

Let i = and let ao = n — 1. 

1. If a, < a;(p n ), set fe = z and stop. 

2. Otherwise, let a^+i = £ ai , then let i = i + 1 and return to 1. 

For each < i < k for which a is defined, if > x(p n ) then y(p ai ) < y(Pn) or else the 
point p„ was a better choice for p ai . By the inductive hypothesis, p ai = t ai . By construction, 
I-BqJp — o-i — £ ai = a% — aj+i for all i < k. Since B ai n i? aj = for all i =/= j, we conclude that 

^iZo B a z hasn-l-a fe points of P. Thus, | [a k , n) x [0, y(p„)]|p > \{p n } U B a< If ^ n ~ a fe- 
By the choice of h n minimum, it follows that h n = y{t n ) < y{p n ) as required. □ 

The structure of the containment relations among the boxes Bi turns out to be interesting in 
its own right, and we explore aspects of it here and later in the paper. 

Lemma 8. If nx > —oo and P is I-reasonable, then for n € X\ {nx}, either he n > h n or £ n = nj. 

Proof. Assume t n ^ nj, suppose he n < h n , and write m = i n . Then both B m and B n are contained 
in [£ m , n) x [0, h n ], so | [£ m , n) x [0, h n ]\p > n — t m . This contradicts either the choice of h n or the 
choice of i n . □ 

Lemma 9. If nx > — oo and P is X-reasonable, then for any i,j e I \ {n-i} with i < j, either 
Bi n Bj r = or Bi C Bj. 

Proof. Suppose that Bi D Bj ^ <fi, in particular this implies lj < i. We prove that Bi C Bj by 
proving that hi < hj and ij < l\. 

The minimality of hj implies that \[£j, j) x [0, hj)\p = j — lj — 1 and that |[z, j) x [0, hj)\p < 
j — i — 1. Thus |[^, «) x [0,/ij)| >i — £j, which immediately implies that hi < hj. 

We now prove lj < ^j. Suppose that £, < Then, by reasoning as above, and using the fact 
that hj > hi we have that \[£i,£j) x [0,hj)\p > \[£i,£ 3 ) x [0,/ij)|p > £j — £j. This implies that 
|[^ij i) x [0) ^')|p ^ J ~ ^i) which contradicts the definition of lj. □ 

Lemma 10. If nx > —oo and P is I-reasonable, then for any ?,j £ I \ {nx} such that lj < i < j , 
there is a path in IPC(P,X) between i and lj. 

Proof. Observe that since lj < i, Bi C Bj by Lemma 9. We apply induction on i — lj. If i — £j = 1 
then we must have |_£(pi)J = lj, so a — (i,£j), verifying the claim. 

For larger values of i — lj, first note that since Bi C Bj, we must have lj < li < [x(pi)\ < i. If 
lj = L x (Pi)J then Bi is a path from i to lj. Otherwise, [a;(Pi)J — lj < i — lj, so by induction there 
is a path from [^(Pi)] to lj, which together with edge yields a path from i to lj. □ 

2.1 Point process invasion percolation in the upper half-plane 

For suitable pointsets P, we may hope to define a version of the invasion percolation procedure in 
which X = Z (or more generally when nx = —oo). This is indeed possible and the resulting infinite 
graph can be said to capture the behavior of invasion percolation "very far from the root." A direct 
inductive description of the graph seems difficult, and so we define the object IPC(P, Z) as the limit 
of IPC(P, Z n [to, oo)) as to — > — oo. Later, we shall also see how the alternative characterization 
of the connection rule given by Definition 6 and Lemma 7 can be used to define this extension of 
the invasion percolation procedure. 
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As before, we desire as few restrictions on P as possible. In this section, we suppose we are 
given a set of points P C K 2 and an interval X with riz > — oo and mj < oo. We say that P 
is seemly (or X-seemly, if X is not clear from context) if P satisfies conditions 1 through 3 and 
additionally either (a) nj > — oo, or (b) nx = — oo and P satisfies conditions 4 and 5, below. 

4. For any n£l, there are infinitely many m G In (— oo, n) such that | [to, n.) x [0, 1] \p > n — m. 

5. If A < 1 then for any n G X, there are at most finitely many m G X n (— oo,n) such that 
|[m, n) x [0, A] |p > n — m. 

The reader can verify that the following two examples almost surely produce seemly point sets. 

Stationary limit of IPC on T a . Let P be defined by choosing a independent, uniformly random 
points in the set [n, n + 1) X [0, a) for each n G X = Z. 

Stationary limit of the Poisson IPC. Let P be a Poisson point process of intensity 1 in the 
upper half plane and let X = Z. 

The following lemma essentially states that for X-seemly pointsets with nj = — oo, all edges 
have weight less than 1. 

Lemma 11. If nj = — oo and P is X-seemly, then for any n G X there exists m G X such that 
ft„(P,ln [to, oo)) < 1 for all integers m < Too- 

Proof. By condition 4, |[m, n) x [0,1] |p > n — m for infinitely many integers m < n; therefore, 
ft, n (P,Xn [m, oo)) < 1 for infinitely many integers m < n. But h n (P,X (~l [m, oo)) = y(p n (P,X D 
[m,oo))) and y(p n (P,X D [m, oo))) is non-increasing as m decreases, so by condition 3 y(p n (P,X D 
[m, oo))) < 1 for all m small enough. □ 

We next consider the family of intervals X n [m, oo) for m G Z, and show that as to — * — oo, 
each vertex only changes its parent a finite number of times. This allows us to consistently define 
the limiting object IPC(P,X). 

Lemma 12. If P is X-seemly, then for any n G X, there exists m > — oo such that p n (P,X D 
[to, oo)) = p n (P,X H [too, oo)) for all to G X n (— oo, too]. 

Proof. The lemma is obvious if nj > — oo so assume m = — oo. Fix n G X and suppose the assertion 
of the lemma fails for this n. Then there exists a strictly decreasing integer sequence {to^I^q and 
a sequence {ftli^o °f distinct points in P such that p n (P,X C\ [mi, oo)) = for all ieN, whose y- 
coordinates decrease strictly as i increases. By Lemma 11, there exists some io such that y(qi) < 1 
for all i > i . But then for all i > i , B n (P,X n [mj,oo)) C [^ n (P,Xn [m i7 oo)),n) x [0,y(gj)], and 
so for such i, 

\[£ n (P,Xn [TO,,oo)),n) x [0,y(qi)]\ P > \B n {P,X n [mi, oo))| P > n - f»(P, ^ n K, oo)). 
This is a contradiction to condition 5. □ 

For a seemly pointset P, we now define IPC(P,X) to be the graph with vertex set X and such 
that for each n G X, e n = e„(P,X) = lim m ^_oo e n (P,X fl [to, oo)). This limit is well defined by 
the preceding lemma. We likewise define p n (P,X), £ n (P,X), h n (P,X), and B n (P,X). By a limiting 
argument Lemmas 8, 9 and 10 are also valid with respect to IPC(P, X) when nj = — oo. We 
therefore obtain the following theorem. 

Theorem 13. If P is X-seemly, then IPC(P, X) is a tree. 
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Proof. Since it is clearly acyclic, we just need to show that IPC(P, X) is connected. Suppose i,j el, 
i < j. Let l°- = £j and for t > 1, t € N, let = ^t-i. Then there must exist t € N such that 

i e i?£t . By Lemma 10, there is a path between j and £* +1 and there is a path between i and £*- + . 
As « and j were arbitrary, this completes the proof. □ 

An advantage of the current formulation of invasion percolation is that we can equivalently 
define the limit process via conditions on the numbers of points in boxes [to, n) x [0,h]. More 
precisely, the following lemma is easily verified. 

Lemma 14. Suppose P is X-reasonable. Fix k, n with nj < k < mj + 1 and < n < {k — m) + 1> 
and y > 0. In order that £k = k — n and that h — y, it is necessary and sufficient that the following 
three conditions hold. 

• | [k — n, k) x [Q, y] \p = n and \ [k — n, k) x [Q, y) \-p = n — 1 ( call this condition E — E(k — 
n,k,y,P)). 

• For all < m < n, \[k — to, k) x [0, y) \p < m ( call this condition F — F(k — n, k, y, P)). 

• For all m G N, | [fc — n — to, fc — n) x [0, y]\-p < m ( call this condition G — G{k — n,y, P)). 

In this case, = [k — n,k] x [0,y], p n is the unique point p € P with y(p) — y, and e n = 
(n, |x(Pn)J)- 

We will sometimes have use for the condition G(k — n, y~), which is the same as the condition G 
above but with [0, y] replaced by [0, y). The next lemma provides a condition under which we can 
determine the behavior to the right of a given integer n without further reference to the behavior 
of P to the left of n. Its proof is obvious and is omitted. 

Lemma 15. Suppose P is X -reasonable. Fix nj < n < mx+l and y > 0, let Q = {p S P : x(j>) > 
n i yip) 5: y}, and let J — In {n, . . . , oo}. If G(n, y~~) holds and Q is J -reasonable, then for all m 
with n < to < TOz+ 1, £m(P,T) = £ m (Q, J), h m (P,l) = h m (Q,J), and p n (P,X) = p n (Q,J)- 

We will also have use of the following sufficient condition for Q to be reasonable. (Again, the 
proof is straightforward and is omitted.) 

Lemma 16. Let I, P, k, n, and y be as in Lemma 14, let J — {k — n, . . . , k} and let Q = Pfl ([fc — 
n, k] x [0, y\). If E, F, and G all hold, then Q = P n Bk and Q is J -reasonable. 

2.2 Box graphs 

As we saw above, the boxes B n play a useful role in our study of invasion percolation. The boxes 
can also be seen to capture information about the structure of the point process invasion percolation 
procedure itself. For example, it is easily checked that if the procedure explores some edge e lying 
within a box B n , then it will explore all other edges lying within B n before exploring any edges 
with an endpoint outside of B n . (Of course, the procedural interpretation does not exist when 
T = Z, but in this case we can still think of the boxes as capturing information about the process 
behavior "far from the root".) 

In this section, we introduce a graph which characterizes the containment relation among the 
boxes. Given an X-reasonable pointset P, we define BG(P,X) to be the graph with vertex set 
X\{riz,+oo} and such that, fori < j, i and j are joined by an edge if and only if Pi(P,X) C Bj(P,X) 
and Pj(P,X) % Bj'(P,I) for any i < j' < j. Also, for i G X\ {nj, mj}, we write Oj(P,X) for the 
parent of i in BG(P, X). 

The examples shown in Figure 2 demonstrate that between the graphs IPC(P, X) and BG(P, X), 
neither is determined by the other. (Theorem 1(c) is essentially a consequence of this fact.) 
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Figure 2: The leftmost and middle sets of points have the same BG graphs but different IPC graphs. 
The middle and rightmost sets of points have the same IPC graph but different BG graphs. 



Clearly, BG(P, I) is acyclic for any X. We shall show that BG(P, Z) is a tree (i.e., connected) 
under the additional assumption of the "rightward version" of condition 5. 

6. If A < 1 then for any m £ I, there are at most finitely many n £ Z n (m, oo) such that 
|[m, n) x [0, A] |p > n — to. 

If P satisfies conditions 1 through 6 with T = Z, we say that P is exemplary. Both examples 
of the last subsection are almost surely exemplary pointsets. 

Lemma 17. Suppose mj = oo and P is an I -reasonable pointset that satisfies condition 6. Choose 
any to £ X \ {nx,+oo} for which h m < 1 and for which there is no m! £ Z n (m, oo) such that 
B m C B' m and h' m > 1. Then there are infinitely many n £ Z n (m, oo) suc/i £/ia£ P TO C B n . 

Proof. Suppose to is as in the statement of the lemma but that there are only finitely many n > m 
such that B m C B n . Then by replacing B m with the tallest box that contains it, we may assume 
that in fact there is no n > to such that B m C B n . By condition 6, we may choose n > m for 
which | [m, n) x [0, /i m ]|p < n — to. Thus, there must be i £ {to + 1, . . . , n} for which hi > h m , so 
take i minimum such that this holds. By Lemma 8, we must then have < to, and so by Lemma 9 
we must have B m C Bi, a contradiction. □ 

Before showing that BG(P, Z) is a tree, let us first use the lemma to confirm the basic property 
of exemplary pointsets that every point of P under the line y = 1 lies along the top of some box 
B n . 

Proposition 18. If P is exemplary, then for all p € P PI [nj, +oo) x [0, 1), we have p — p n for 
some n &T. 

Proof. Let p S P have y(p) < 1. We first note that if p S B m for some to, then p = p n for 
some [~£e(p)] < n < to. Also, there must be some integer k < x(p) for which hk > yip), or else 
|[[x(p)] — i, \x(p)~\] x [0,y(p)]|p > i for infinitely many integers i > 0, which contradicts condition 
5. 

By Lemma 11, h m < 1 for all to G Z, so by Lemma 17, there are infinitely many to £ Z for 
which Bk C S m . One of these boxes contains p, so p — p n for some n, as claimed. □ 

Theorem 19. J/P is exemplary, then BG(P, Z) is a tree. 

Proof. It suffices to show that BG(P, Z) is connected. Recall that h n < 1 for any n £ Z, by 
Lemma 11. Fix i < j, i,j £ Z. By Lemma 17 there are infinitely many to such that Bi C P m . 
Take the least such to for which m > i - then also P, C B m , and so by Lemma 10 there exist 
paths from i to to and from j to to. The theorem follows. □ 
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2.3 Random walks and the forward maximal process. 

Let P be a point set satisfying condition 1. Given h > and k G T, we define random walks 
gk.h = S k,h( P j and L k,h = L k,h(pj ag f n ows . We set S ^ h = L ^,h = q andj for ■ > 1; get 

S*' h = \[k,k+i] x [0, h]\p — i, and set L i ' — \ [k, k — i] x [0, h]\p — i. We also define random walks 
S k ' h and L k ' h , by replacing [0, h] by [0, h) in the above definitions. In other words, the random 
walks S k ' h and L k ' h ignore points on the line y = h. (For fixed h, for any of the random point 

sets P we will consider, it will be the case that with probability 1, S i ' — S^' h for all i, but we 
will at times work in conditional settings in which these two random walks are not identical.) 

We say that S k ' h survives if for all i > 0, S^' h > 0, and otherwise say that S k ' h dies. Also, 
say that S k,h has a chance if > for some i > 0, and otherwise that S k ' h has no chance. We 
extend these definitions to L k ' h by symmetry. 

We now establish two more basic properties of IPC^T^Z), under the following additional as- 
sumptions. 

7. If A > 1 then for any m G Z, S 1 ™' < n for at most finitely many n € N. 

8. S ,fe ' 1 dies for all k G X. 

Roughly speaking, condition 7 is a "rightward version" of condition 4. If P is an X-reasonable 
pointset that satisfies conditions 6, 7 and 8, we say that P is distinguished (or 2 -distinguished, if 
X is not clear from context). The first two examples given in the introduction to this section are 
almost surely distinguished pointsets. 

We will see that for distinguished pointsets P, when n% > — oo and mj = oo, BG(P, I) is not 
connected - in this case we call the connected components the ponds of BG(P, I). We will see later 
that this agrees with the normal use of this term in the invasion percolation literature. 

Lemma 20. If nj > — oo, rrix = oo and P is X -distinguished, then for any m € I\ {nj,+oo}, if 
h m > 1 then there are at most finitely many n £Z, n > m such that B m C B n . 

Proof. Suppose otherwise. Without loss of generality, we may assume that h m > 1. Consider the 
integer sequence {n,i}°^ , which is defined as follows. Let n — m. For i 6 N, let n i+ i be the 
smallest integer greater than m such that B ni C B ni+1 . Then for any i £ N and all rij < n < 7i;+i, 
we have B n C [m, rij+i) x [0, h ni ] for all rij < n < Ui+\ by Lemmas 8 and 9. Furthermore, it follows 
from the definition of B n and Lemma 9 that \([rii, rij+i) x [0, h ni+1 )) \ (U„ i<n <„ i+1 i? n )|p = (or 
otherwise there would be a smaller choice for h ni+1 ). Thus, |[nj,nj+i) x [0,h ni ])p = | U„ i<n <„ i+1 
B n \p = n i+ i — rii — 1. Since h no < h n . for alH > 0, it follows that \[n , n i+1 ) x [0, /i no ]l-P < — n o 
for all i > 0. Since /i„ n > 1, this is a contradiction to condition 7. □ 

Theorem 21. I/nj > — oo, mj = oo and P is X -distinguished, then BG(P, T) contains infinitely 
many components, all of which are finite. Furthermore, for any given component, if n is the 
rightmost integer belonging to the component, then h n > 1 and the set of vertices of the component 
is {l n + 1, .. . , n}. 

Proof. Let P satisfy the hypothesis of the theorem. We construct a sequence of integers {n i }°l 
as follows. Let no = nj. For i E N, let rij+i be the largest integer greater than m such that B n . +1 
contains the point (rii,0). We must now show this sequence is well-defined. Suppose not, and let 
i be minimum such that there is no valid choice for rij+i. Then there are infinitely many integers 
n > rii such that B ni C B n . Since i was chosen minimum, for each such n we have l n = n». By 
Lemma 20, it must be that for each such n, /i n < 1. But this implies that |[rij,n) x [0, l]\p > n — rii 
for all n G Z n [n^oo), a contradiction to condition 8. Thus the sequence {rii}°^L is well-defined. 

For all i G N, it follows from the definition of r^ + i that £ ni+1 = ft-i, and there is no integer 
n > rii + i for which B ni+1 C B n ; thus and B ni+2 are in separate components of BG(P, I). By 
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Lemma 9, all B n such that n$ < n < rij+i are contained in B ni+1 and hence in the same component 
of BG(P,X). 

If h ni+1 < 1 for some % € N, then by Lemma 17, there arc infinitely many n£Zfl (rij+i,oo) 
such that B n . +1 C P„, but this is a contradiction to the choice of By Lemma 8, we have for 

all i € N that /i„ i+1 > h n . +2 . We conclude that h n > 1 for all i e N. □ 

Given an interval X, with nj > — oo and mj = oo, and an X-distinguishcd pointset P, define 
{rii}°l Q — {ni(P,T)}°l Q as in the proof of Theorem 21. 

Corollary 22. If m > — oo, mj = oo and P is 2 -distinguished, then IPC(P,X) is a tree that 
consists of a unique infinite backbone (i.e., a unique, infinite, self-avoiding path originating from 
the root) from which emerge finite branches. Furthermore, the backbone contains the points {ni}°l Q . 

Proof. Clearly, IPC(P,X) is acyclic, and is connected by Lemma 10, so is a tree. For each integer 
i > 1, let $j be the unique path from m to no = n% in IPC(P, X). By Lemma 10 it follows that for 
all i > 1, &i is a sub-path of and so the limit $ = lim.^oo <j>j is a well-defined infinite path 

starting from nj. Furthermore, for any integer fc, any path starting from that is edge-disjoint 
from $ must have all its elements among n,, . . . , n,j+x — 1, where ni < < ni+\. Thus, all branches 
leaving $ are finite. □ 

Remark. In general, relaxing any of the conditions in the definition of distinguished pointsets 
may cause the conclusions of Corollary 22 to fail. To provide, just one example, the following 
pointset satisfies conditions 1 through 7, but not the conclusion of Corollary 22. The pointset P 
contains no points except the following. Place k points inside [0,1) x [0,1/3). Place each of the 
points (i + 1, 1 - l/(i + 2)) for % e N. Then IPC(P, N) has k infinite backbones. 

This completes our study of deterministic properties of the invasion percolation procedure. In 
the next section we begin our study of what happens when the underlying point set is random. 

3 Invasion percolation on the PWIT 

Throughout Section 3, V denotes a Poisson process of constant intensity 1 in [0,oo) x [0, oo), 
so V is almost surely N-distinguishcd. By Corollary 5, IPC(P,N) is distributed as the invasion 
percolation cluster T of the PWIT, so results for IPC(P, N) apply to T mutatis mutandis. Below, 
we will derive more precise statements about the structure of IPC(P,N) than can be made under 
the assumptions of Section 2. First, however, we state two "ballot-style" theorems for stochastic 
processes that we will use repeatedly. 

3.1 Two ballot-style theorems 

The following result was proved independently by Tanner [22] and Dwass [8]. 

Lemma 23 (Cycle lemma). Suppose that X±, . . . , X n are integer-valued, cyclically interchangeable 
random variables with maximum value 1. Then for any integer < k < n, 

P{5,>0Vl<!<n \S n = k} = -. 

n 

The next result was proved by Takacs [21, page 12]. 

Lemma 24 (Stationary ballot theorem). Let Xi, X 2 , ■ ■ ■ be an infinite sequence of i.i.d. integer 
random variables with mean fi and maximum value 1 and for any i > 1, let Si = Xi + . . . + X,-. 
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Figure 3: The region on the left maps onto the region on the right when P is sent to P l 



The 



P{5„ >0Vn = l,2, . 



fi if fj, > 
if n < 0. 



Now let P be a random pointset in, say, [m, n] x [0, oo). We recall the definition of the condition 
F(m, n, y) from Lemma 14, and will abuse notation by also writing F(m, n, y, P) for the event that 
the condition F(m, n, y, P) holds. (At times we write F{m, n, y) in place of F{m, n, y, P), when P is 
clear from context.) Notice that if P is a uniform set of n— 1 points in [0, n] x [0, A), for some A > 0, 
then applying the cycle lemma with X t — 1 — — x [0, A)|p (and so Si = i — \[0,i) x [0,A)|p) 
for i — l,...,n, it follows that the probability that F(0,n, X, P) occurs is precisely 1/n. By an 
argument of a similar nature, we can straightforwardly derive the following lemma (which can also 
be deduced from an existing result [17, Theorem 4] for invasion percolation on K n and a limiting 
argument). 

Lemma 25. Fix an integer n > 1 and list the n elements ofPD ([0, n] x [0, oo)) of lowest height 
as q\, . . . , q n , in increasing order of height. Then for each i — 1, . . . , n, P {p n = qi} = 1 jn. 

We emphasize that the n elements of TT)([0, n] x [0, oo)) of lowest height may not all be elements 
of the set {pi, . . . ,p n }, or indeed of the set {pi}^. 

Proof of Lemma 25. Fix n and let A = y(q n ). Clearly, p n will be among qi,...,q n . Also, let 
1 = {0, . . . , n}, let P = {q\, . . . , q n }, and for each i = 0, . . . , n — 1, let P l — {q\, . . . , q % n } be the 
cyclic shift of P to the right by distance i. Then for all i — 1, . . . , n — 1, P l is distributed as n — 1 
uniform points in [0, n] x [0, A), together with a single uniform point of height A. We claim that 
with probability 1, for each j = 1, . . . , n, there is exactly one i = i(j, P) € {0, . . . , n — 1} for which 
p n (P l ,T) — q % -. Since the P 1 are identically distributed it follows from this claim that 



P { Pn (P,T) = qj }=-J2P { P n(P\l) = q}} = -P { |J {Pn(P\l) 



i=0 



. i=0 



1 

n 



which proves the theorem. It thus remains to prove the above claim, which we do by contradiction. 
Thus, suppose that for some j S {l,...,n}, there are distinct i, i' € {0, ...,n— 1} for which 
p n (P l ,T) — qj and p n (P % ,1) — q) ■ By replacing P by either P n ^ 1 or P n ~ l if necessary, we 
may assume that i' = 0. Let qj = g° = (xj,yj). We must have \[n — i,n] x [0,2/j)|p < i (or else 
p n (P,J) ^= qj); on the other hand, \[£ n (P,l),n] x [0, y 3 ]\p = n — £ n (P,l). 

Let k = 7i — ^„(P, I), the length of B n (P, X). If k > i then we also have | [£ n , n — i] x [0, yj]\p > 
n — £ n — i + 1, so |[£ n + i, n] x [0, yj]|p< > n — £ n — i+l and h n (P l ,T) < yj, contradicting the fact 
that p n (P l ,T) — q % y It follows that k < i, i.e., that i — (n — £ n ) > 1. In this case, we have that 
for each m = 1, . . . , i — (n — £ n ), [£ n — m, £ n ] x [0, yj] < m (or else we would have either chosen h n 
lower or £ n smaller). Translating the above information to P l , we see that \[i — k,i] x [0, Vj]\pi = k, 
that \[i — k, i] x [0, t/j)|p> < fe, and that \[i — k' , i) x [0, yj]\pt < k' for each k' € {fc + 1, . . . , ?} (see 
Figure 3). Thus, by Definition 6 and Lemma 7, pi(P t 1 T) = g*, contradicting the assumption that 
p n {P\l)=q). 

□ 
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We next elaborate on a connection between Poisson Galton- Watson trees and qucucing theory 
that will be useful for many subsequent calculations. 

3.2 A fact from queuing theory and an aside on Poisson Galton- Watson 
duality 

The following basic result was first noted by Borel [4]. Consider a queue with Poisson rate A 
arrivals and constant, unit service time, started at time zero with a single customer in the queue, 
and with any arbitrary servicing rule (i.e., not necessarily first-in first-out). We may form a rooted 
tree associated with the queue process run until the first time r that there are no customers in the 
queue (or forever, if the queue is never empty), in the following manner. If a new customer joins 
the queue at time t, he is joined to the customer being served at time t. We denote the resulting 
rooted tree by T. Then T is distributed as a Poisson(A) Galton- Watson tree (we write PGW(A), 
for short) [4]. 

If the arrival times are given by the ^-coordinates of the points of Poisson process Q = V n 
([0, oo ) x [0, A)), we may also associate an interpolated random walk to the process, by St = 
\[0,t) x [0, \)\ Q - t, for t £ R + . Then |T| is simply the first time t that S t = -1, i.e., that 
| [0, t) x [0, A) | q = t — 1. Note that given that | T\ = m < oo, Q is distributed as m — 1 independent 
uniform points in [0, t) x [0, A], conditioned on F(0, n, A, Q) occurring. We also observe that for all 

i < Si = Sf' A , where S°' X is the random walk defined in Section 2.3. This has immediate 
implications for the events defined in Section 2.3. In particular, the tree T is infinite if and only if 

survives. It follows that for all < h < 1 we have P |s ,fe,A survives | = 0, and for all A > 1 

we have P |s ,fe,A survives j = 0(A), the probability of survival of a PGW(A) branching process. 
Similarly, by Lemma 24, we have that the probability that there is ever a time t at which the total 
number of arrivals is at least t, is min(A, 1). Thus, if < A < 1 then P |5 fe ' A has a chancej = A, 

and if A > 1 then P |s lfc-A has a chancej = 1. Of course, the exact same identities hold with 
gk,\~ re pi ace d by S h ' X , L k > x ~ , or L fe ' A . 

We continue to think of arrival times as given by points of Q. It will be useful for us to view 
the above queuing procedure as creating a tree whose nodes arc labelled by integers rather than 
by the elements of the queue. We do so by re-labelling each node of T (i.e., each customer c) 
with the (integer) time at which c begins being served. Furthermore, suppose that we take as 
our servicing rule the invasion percolation rule - i.e., the rule that prioritizes customers (points of 
Q) with lower y-coordinate over those with higher y-coordinate - and call the resulting tree T\. 
Then T\ is precisely the subtree of TPC(V , N) containing the root and all nodes joined to the root 
by paths all of whose edges have weight less than A. Of course, everything still holds if we take 
Q = V n ([0, oo) x [0, A]) - i.e., if we include points at height precisely A - as long as we replace 
S°> x ~ by S°' X and replace the phrase "less than A" by "at most A" . 

As a consequence of the above discussion we have the following important fact. 

Lemma 26. Fix any integer n > 1, any A > and let P be a set ofn—1 independent uniform 
points in [0, n] x [0, A). Given that F(0, n, A, P) occurs, the tree IPC(P, {0, . . . , n — 1}) is distributed 
as PGW(A) conditioned to have n nodes. Furthermore, suppose thatp is a uniformly random point 
on the line segment [0,n] x {A}. Then under the same conditioning, IPC(P U {p}, {0, . . . , n}) is 
distributed as PGW(A) conditioned to have n nodes, together with an additional vertex (vertex n), 
joined to a uniformly random element o/O, .. . ,n — 1. 

We remark that for any fixed n, the distribution of PGW(A) conditioned to have n nodes does 
not depend on A, and is precisely that of a uniformly random labelled rooted tree (or Cayley tree) 
on n nodes, after the labels but not the orders of children have been discarded. It turns out that a 
version of Lemma 26 also holds for the box tree; see Lemma 37, below. As noted just after Lemma 
24, the probability that F(0,n, A, P) occurs is precisely 1/n. Thus, the distribution of |PGW(A)| 
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is given by 

1 e~ Xn (Xn\ n ~ l 

P {I PGW(A)| =n} = -P {Poisson(An) = n - 1} = ^ , 

n n\ 

for all positive integers n (a well-known fact which we record for later reference). When A = 1 this 
is called the Borel distribution. 

We briefly explain a further basic fact about the function 8(A) — P {| PGW(A)| = oo} and about 
Poisson Galton- Watson duality. By considering the number of children in the first generation of 
PGW(A), we see that 1 — 9(A) = e~ X9 ^ x \ and by differentiating this identity, we see that 

0'(A)(1 -A(l- 0(A))) = 0(A)(1- 0(A)). (1) 

an equation we will have use of later. Next, given A > 1, let m = m(A) < 1 be such that 
Ae~ A = me~ m (we call m the dual parameter for A). Then m = A(l — 0(A)), from which it is easily 
seen that conditional on being finite, PGW(A) is distributed precisely as PGW(m). 



3.3 IPC('P, N) and the forward maximal process 

By Corollary 22, IPC(V, N) consists of a unique infinite backbone which in particular passes through 
the nodes {«i}^ ' an d from all nodes of which emerge finite branches. Let the edges of the 
backbone be e%, e 2 , . . ., and for each integer i > 1 let Wi = sup J>i W ej , so {Wi}^! is the PWIT 
forward maximal process. From the perspective of the PWIT, the nodes rii are the nodes at which 
the forward maximal weight along the backbone decreases. 

Lemma 26 allows us to provide another picture of the structure of IPC(7 : ',N). First, for each 
integer i > 0, let Tj = Ti(V, N) be the subtree of IPC(7-', N) on nodes m,. . . , n i+ i — 1 (these nodes 
induce a tree by Lemma 10). The set P = V n ([n i} n i+ i) x [0,h n . +1 )) is distributed as (n i+ i — 
m — 1) independent uniform points, conditional on F(ni, rij+i, h n , P) occurring. Furthermore, 
Pfl ([fii, rij+i) x {h ni+1 }) contains a single uniform point. Thus, by Lemma 26, we obtain that Ti 
is distributed as PGW(/i nj+1 ) conditioned to have ni+i — n% nodes, and that n^+i is joined to a 
uniformly random clement of Ti. Applying this for all i, we obtain the following theorem. 

Theorem 27. Given {ni}f^ , IPC(V,N), viewed as an unlabelled tree, can be built as follows. For 
each integer i > let Ti be a uniformly random labelled tree on rii — 7ij_i vertices. For each integer 
i > I; join the root of Ti to a uniformly random vertex ofTi—i. Finally, discard all labels. 



Remark. It also follows straightforwardly from Lemma 26 that given {ni}°l and {h ni+1 }iZo, 
IPC^jN) viewed as a weighted unlabelled tree can be built from the tree described in Theorem 
27 as follows. Independently for each integer i > and each edge e of Tj, assign e a random weight 
with Uniform[0, h ni+1 ] distribution. Also, for each integer i > 0, give the unique edge from Tj+i to 
Ti the weight h n . +1 . We omit the details. 

We next show that {(|T,-_i|, h ni )}^l 1 = {(rii — n i-i, h n .)}°*L 1 is a Markov process and specify 
the transition probabilities. First, for any i > 1, given h ni , the set Q = V n ([n,;,oo) x [0, h ni )) is 

precisely a Poisson point process of intensity 1 conditioned on the event that S " "< (Q) survives 
(which is precisely the event that Q is {n,, nj+i, . . . , }-reasonable. Furthermore, given h n ., the 
condition G(ni, h~. ) holds for V . Thus, by Lemma 15, we can determine the structure of IPC^, N) 
restricted to {n^, . . . , oo} by considering only the points in Q. It follows that {(n.; — rij_i, h ni )}°^ =1 
is a Markov process, as claimed (and also that {h ni }°^L 1 is a Markov process). Next, for 1 < y < h, 
let 

t r \ r P {^n i+ i € dy | h ni =h} 

h(y) = Jim , 

dy^o dy 

and for n > let 

, , s ,. P {h ni+1 € dy, (n i+ i - m) = n \ h ni = h\ 

fh(n,y) = lim — 

dy^o dy 
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so f h (y) = Y, n fh(n, y). By the above comments, f h (y) and f h (n, y) do not depend on i. 
Lemma 28. For all i > 1 and 1 < y < h, 

A0/) = ^y ««* fh ^ = m („_!)! 

Combining the two results in Lemma 28, the following corollary is immediate. 
Corollary 29. For all integers n, i > 1 and « > 1, 

P {n i+1 - = n | fe Wi+1 = y } = (n-l)\ ( ) 

Proof of Lemma 28. We have 

P {K l+1 <V \h ni =h} = P {s n " y (V) survives S n *' h ~ (V) survives } 

p{S n i>y(V) survives} 
~ P {S n ^ h ~(V) survives} 

= %) 
9{hY 

and the first claim of the lemma follows by differentiation. 

As mentioned, fh(y) and fh(n, y) do not depend on i so we take i — (and thus rii — 0). In order 
to have nj.— no = n and h ni £ dy, we need that | [0, n] x [0, y)\-p = n— 1, that | [0, n] x [y, y+dy)\-p = 1, 
that F(0,n,y,V) occurs, and that S n,v (V) survives. The probabilities of the first two events are 
easily bounded. The probability of F(0,n, y,!^) given that |[0, n] X [0, y)\-p = n — 1 is 1/n by the 
cycle lemma. Finally, the event that S n,v (P) survives is independent of the first three events, 
and has probability 9(y). Thus, 

e~y n (yn) n ~ 1 1 1 

P {h ni+1 G dy, (n i+1 - n l ) = n | h ni = h } = — — — • (1 + o(dy))ndy ■ - ■ 6{y) ■ -^-r, 

' [n — 1)1 n 9(h) 

from which the second claim of the lemma follows. □ 



We next derive the distribution of the distance along the backbone between rij_i and n,. 
For i > let d.; = di(V,T) = dipc(^i! f^i+ij- As with the quantities studies above, we have 

that given h ni+1 , d, is independent of the past. For < x < 1 we say X = Geometric^) if 
P{X = k} = x k (l-x). 

Theorem 30. For all i > and all y > I, given that h ni+1 — y, d, = 1 + Geometric(m(n)). 



The following theorem derives the distributions of the trees hanging off the backbone and within 
a given pond. 

Theorem 31. Fix i > and y > 1. Given that h 1li+1 = y, for all k with n.j < k < n i+1 and for 
which k is on the backbone, the subtree of IPC(V,N) containing k and containing no other vertices 
of the backbone, is distributed as PGW(m(y)). 



Together, Theorems 30 and 31 provide another Markovian characterization of IPC(7 : ',N): we 
may construct a tree with the distribution of IPC(7 : ',N) by growing the trees hanging off the 
backbone one- by-one, where the branching distribution of the trees depends on the current forward 
maximal weight. (This characterization is exactly that which is claimed in Theorem 3, which 
proved below.) The forward maximal weight process evolves according to the dynamics implied 
by Lemma 28 and Theorem 30: first stay constant for a geometric amount of time depending on 
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the current forward maximal weight, then decrease the maximal weight according to Lemma 28. 
This characterization is essentially the a — > oo limit of results of Angel et al. [2] described in the 
introduction, for invasion percolation on the regular cr-ary tree. However, it does not seem trivial 
to derive these results from theirs by a limiting argument and local weak convergence, since they 
depend not only on the graph structure of the tree but also on the weights. 

If one wishes, at this point one can apply all the methodology of [2] to see that corresponding 
results hold for invasion percolation on the PWIT: notably, convergence to the Poisson lower 
envelope, mutual singularity of IPC and IIC measures, and spectral asymptotics all hold for invasion 
percolation on the PWIT. We have not included the details as the development is essentially 
technical, requiring no significant ideas not already found in [2]. 

Since m(A) = A(l — 0(A)), we also obtain the following corollary of Theorems 27, 30, and 31 
and Corollary 29, which is new as far as we know. For any < p < 1 let D — Geometric (p) and 
let r = Vq, Vi, . . . , Vd — v be a path of length D. For each k = 0, . . . , D, starting from Vk grow a 
PGW(p) tree with root Vk- This yields a triple (T,r,v) p . 

Corollary 32. Fix any < p < 1 and let (T, r, v) p have the distribution described above. Then 
conditional on \T\, T is distributed as PGW(p) conditioned to have size \T\, and v is distributed 
as a uniformly random node of T . Furthermore, let y — m _1 (p). Then for all n > 1, P{|T| = n} 
is given by the right-hand side of (2). 



We now turn to the proof of Theorem 30. Recall that for y > 1, m(y) is the dual parameter 
for y. We will make use of the following easy (and known) lemma. 

Lemma 33. Let T be a Cayley tree of order n with root r, and let v be a uniformly random node 
in T. Then 

P{d T (r, V ) = k-l}= k -l0 
where (n)& = n(n — l)...(n — k + 1). 



Proof. We view T as a doubly-rooted tree with roots r and v. By removing the edges on the path 
from r to v, we obtain a forest of dx{r, v) rooted trees, whose roots are ordered (as r — v\, . . . ,Vk = 
v, say). The number of such forests is i^)k\ ■ (kn n ~ k ~ 1 ) (see, for example, [18], Theorem 3.2). The 
result follows by dividing by the total number of doubly-rooted trees on n labelled vertices, which 
is n n by Cay ley's formula. □ 



Proof of Theorem 30. As mentioned at the start of Section 3.3, for all i, n i+ i is joined to a uniformly 
random element of Ti - say Vi - and dipc(™i, ^i+i) = 1 + dr^ni, Vi). By Corollary 29 and Lemma 
33, it follows that 

P {dipc(n t ,n i+ i) = k \h ni+1 =y} 

OO 

= ^P{d Tl (n I ,v i ) = fc-l | \Ti\ =n}P{\Ti\ =n \ h ni+1 = y) 

n—k 

= ^Hn)k 9(y) e-^m/)"- 1 
=k nk+1 ' : ) ! 

= fc -i %) k e^y{n V y- k 

y 9>{y)^ k n (n-k)\ 
^^Sf; ^ {Poisson(ny) =„-*}. 

W n=k 



By the cycle lemma 
k 



P {Poisson(ny) = n - k} = P (s°' v = -k, S°' y > -k V < i < n} , 
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so 

P {diP C (n,,n J+1 ) = k | h nt+1 = y} = V^^f { S n V = -* for some n) . 

But by the connection with queueing theory explained above, P {S®' v — —k for some n} is precisely 
the probability that k independent PGW(y) all fail to survive, which is (1 — 6(y)) k . We complete 
the proof by applying the identity (1) from page 16. □ 

Proof of Theorem 31. For this theorem we revert to viewing IPC^, N) as a subtree of the PWIT T; 
our proof is based on the proof of Proposition 2.3 of [2]. Given a node v £ T, write T v (resp. %(X)) 
for the subtree of T rooted at v (resp. rooted at v and containing all edges of weight at most A to 

descendants of v) - so %(X) = PGW(A) - and write X*(v) = inf{A : %(X) is infinite}. 

Let r be the root of T, and fix any node v £ T. Fix y > 1 and integers 1 < j < k. Let E v .j,k.dy 
be the event that v is on the backbone, has k children of whom the backbone passes through the 
j'th (in the left-to-right ordering of the PWIT), and X*(v) £ dy. We split E v j t ).,dy into four events 
depending on distinct edge sets of the PWIT. 

Fx T r {y) — %(y) is finite. (This event depends only on edges of T — %.) 

F 2 v has precisely k children of weight at most y + dy - say v±, . . . (This depends only on 
the weights of edges from v to its children.) 

F 3 For i £ {1, . . . ,k}\ {j}, T v . (y + dy) is finite. (This depends only on the weights edges in the 
subtrees %. for i ^ j) 

F4 T v . (y) is finite but T v . (y + dy) is infinite. (This depends only on the weights of edges in T v . .) 

Since the edge sets determining the events F\, ■ ■ ■ , F4, are disjoint, if E occurs then the conditioning 
on subtrees T v . (y) for i £ {1, . . . , k}\{j} is precisely that they are finite. Thus, given that E occurs, 
for each i £ {1, . . . ,k} \ {j}, T Vi (y) is distributed as PGW(m(y)). 

Now let E v ^y = ^i.jE Vt ij t dy ~ so E v .d v is the event that v is on the backbone and that 
X*(v) £ dy. Given the observation at the end of the previous paragraph, to prove the theorem it 
suffices to show that as dy — > 0, given E Vt dy, the number N v (y + dy) of children of v in T v (y + dy) 
approaches Poisson(m(?/)) + 1 in distribution (so that the number of children off the backbone 
approaches Poisson(m)). To see this is an easy calculation. First, 

P {E v<dy } = P {Fx} ■ P {T v (y) is finite but T v (y + dy) is infinite} 
= P{F 1 }-(l + o(dy))0 / (y)dy. 

Next, fixing k > 1, by symmetry, 

P {N v (y + dy) = k, E Vtdy } = (1 + o(dy))P {F^ ■ P {Poisson(y) = k} ■ k ■ (1 - ^(y))^ 1 ■ 0'(y)dy. 

The factor k above selects which of the k children of v is on the backbone. Since m = y(l — 0(y)), 
it follows that 

lim P{N v (y + dy) = k \ E v dy } = P {Poisson(y) = fc}-fc-(l-6»(y)) fe " 1 = P {Poisson(m) = k - 1} , 

dy^O 

which completes the proof. □ 

4 The stationary graph and box processes 

Throughout this section, V denotes a Poisson point process of intensity 1 in the upper half- plane, 
so V is almost surely exemplary and Z-distinguished. 
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4.1 Rooted subtrees in the box tree 

Recall that BG = BG(7 3 ,Z) is the tree with vertex set Z defined in Section 2. For n € Z, we let 
BG™ denote the subtree of BG rooted at n, and write | BG™ | for its size (number of vertices). By 
the definition of BG, it is immediate that | BG™ | = n — £ n . Our main aim in this section is to 
prove the following theorem. 

Theorem 34. BG(7 : ',Z) is distributed as the Poisson LLC, in the local weak sense. 

A key step in proving Theorem 34, one that additionally introduces several of the main ideas, 
is the following theorem. 

Theorem 35. For all n E Z, conditional on ^ n ("P,Z) and h n (V,Z), BG"('P,Z) is distributed as 
PGW(/i") conditioned to have n — £ n nodes. Furthermore, unconditionally BG™ is distributed as 
PGW(l). 

Corollary 36 ([17], Theorem 1). We have [x(p )\ = -[AV\, wh ere A is Borel distributed, and 
V is Uniform[0, 1] and independent of A. 

Proof. Given £q and ho, the line segment [^0;0] x {^-o} contains a single uniformly random point, 
and this point is Pq. The second assertion of the theorem implies that £ is Borel distributed, and 
the corollary follows. □ 

For the next several pages, we focus on developing the tools needed for the proof of Theorem 35. 
By translation invariance, it suffices to prove Theorem 35 with n = 0. We prove the theorem by 
way of the following analogue of Lemma 26 that holds for the box tree. 

Lemma 37. Let n, X, P and p be as in Lemma 26. Given that F(0,n, A, P) occurs, BG(P U 
{p}, {0, . . . ,n}) is distributed as PGW(A) conditioned to have n nodes. 

We remark that for any A > 0, PGW(A) conditioned to have k nodes and PGW(l) conditioned 
to have k nodes are identically distributed. Thus, in proving Theorem 35 and Lemma 37 we may 
and shall at times assume without loss of generality that A = 1. Figure 4 contains an example 
of BG(P U {p}, {0, . . . , n}) for P,p as in Lemma 37, with n — 256. (By the preceding comment, 
the value of A is not important.) In proving the lemma, it will be important to view PGW(l) 




Figure 4: The boxes for a random set of 256 points. The points themselves are omitted. The code 
for generating this image was written by Omer Angel. 

both as an ordered (plane) tree and as an unordered tree. The ordered perspective is natural for 
PGW(l) when viewed as a subtree of the PWIT. Next, fix an unordered, rooted tree U. We will 
abuse notation by writing PGW(l) — U if PGW(l) = T for some ordered tree T with underlying 
unordered tree U . Fix one such tree T, and let aut(fJ) be the number of rooted automorphisms of 
T. (Note: by this we mean the number of distinct plane trees with underlying unrooted tree U. 
For example, for the tree U in Figure 5, interchanging the pair of leaves x and y does not affect 
the plane tree, and aut(U) = 12.) We then have P {PGW(l) = U} = aut(T) ■ P {PGW(l) = T}. 

We next turn our attention to BG°. There is again a natural ordering of children in BG° 
- vertices are integers, and when we refer to a box tree as an ordered tree, we are referring 
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X 



il 



Figure 5: Viewed as an unordered tree, the above tree has aut(t/) = 12. 



to the ordering inherited from the integers. However, unlike in PGW(l), we cannot expect the 
distributions of distinct subtrees to be identical under this ordering. Given an unordered tree U, 
we will also abuse notation by writing P {BG° = U] if BG° is unlabclled, rooted isomorphic to U. 

Our proof of Lemma 37 makes use of the following easy fact. 

Lemma 38. Let r > 2 and let s\, . . . s r be natural numbers. Then 

7T j = 2 Z^li=l a 7T(j) 

where the summation is over all permutations n of {1, . . . , r}. 



Proof. We proceed by induction on r. For r — 2 the sum is over just two permutations and the 
result is 

-J5- + -^_ = l, 



S 2 + Si Si + s 2 

as required. For general r, we partition the set of permutations it of {1, . . . , r} depending on the 
value of 7r(r) - for each k — 1, . . . , r let life be the set of permutations n of {1, . . . , r} with 7r(r) = k. 
Since our aim is to prove that the sum 

ir j=2 l^i=X *f W k=l 7rGn fc j=2 l^i=l & 7r(i) 

has the value one, it suffices to prove that for each k we have 

r 

Err s 7r(j) _ s k 
11 o _ V r s ■ ' 

Since the expression on the right-hand side here is the j = r term of the product for all it € life, it 
suffices to show that 

ireH k j=2 2-ii=\ 

By re-labelling if necessary, this may be deduced from the induction hypothesis. □ 



Proof of Lemma 37. Fix an unordered tree U with n vertices and root r. We will show that 

n-P{BG(P,[n]) = 17} = P{PGW(1) = U \ |PGW(1)| = n} . (3) 

Proving this equality will prove the lemma, since F(0, n, A, P) must occur in order to have BG(P, [n]) 
U, and P {F(0, n, A, P)} = 1/n as noted just after Lemma 24. The case n = 1 of (3) is trivial, so 
suppose that n > 1 and that the proposition holds for all n' with 1 < n' < n. 

Order the children of the root r of U arbitrarily, and suppose that the subtrees U\ , . . . , of 
U rooted at the children of the root have sizes n\, . . . , with respect to this order. Let aut(r) be 
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the number of permutations of the children of r which induce automorphisms of U. (For example, 
the tree in Figure 5 has aut(r) = 2.) 

We note that 

P {PGW(l) = U | | PGW(1)| = n} 
_ P{PGW(1) = U} 
~P{|PGW1| = n} 

=^-^^n p{PGW(1)=I '' ) 

V ' 2—1 

= ^ n nJ ^ p < pgw <d = "> 1 1 «™m = "■ > 

=^n( )n(-)" l -P{PGW(l)^C/ l ||PGW(l)=n i }. (4) 

Next, assume the points of P are listed in increasing order of height as {p\, . . . ,p n }. We first 
consider the sizes of the subtrees of Bq. 

Let tuq = and for i = 1, . . . , k, let rrii = rrii-i + (so in particular nik = n — 1). Also for 
i = 1, . . . , k, let Qi be the set of points p G P \ {p n } satisfying m t _i < x(jp) < rrii, and let p l be 
the point of Qi with greatest y-coordinate. 

We recall the definitions of the events E, F, and G from Lemma 14. In order for n to have 
children with subtrees U%, . . . ,Uk in that order, it is necessary and sufficient that the following 
events occur. 

(I) E(rrii-i, n%i, yip 1 ), P) occurs for each i = 1, . . . , k. 
(II) We have y(p l ) > y(p 2 ) > . .. > y(p k ). 

(III) F(mi_i,m,i,y(p' 1 ), P) and G(m,i-i,y(p' t ), P) occur for each i = 1, . . . , k. 

(IV) BG'™ 1 = Ui for each i=l,...,k. 



First, (I) is equivalent to the requirement that \Qi\ = rii for each i = 1, . . . , k. The x-coordinates 
of points in P, are uniformly distributed on [— n, 0], so 

2—1 

Given (I), for (II) to occur it suffices that for each i = 1, . . . , fc, the point of (P\ {p n }) \ (Qi U . . . U 
Qi-i) with the largest y-coordinate, is a member of Q 2 . Thus, 

k 

P{(n) I (i) } = II : "U-i ■ (8) 

Since y(p*) < 2/(p"0 for j < i, if (I), (II), and Hi^i 1 P( m i~i, m i, y(p 1 )) an hold for some k > 1 then 
it is immediate that G(wi_i, y{p 1 )) occurs for each i = 1, . . . , k. Thus, 

P{(III) \(l),(ll)}=p\f]F( m ^ 1 ,m l ,y*( l )) |(I),(II)) 
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Furthermore, given (I) and (II), independently for each i — 1, . . . ,k, the points of Qi \ {p 1 } are 
independently and uniformly distributed in [mj_i,mj) x [0,y(p 1 )). Thus, by the cycle lemma, 

k 

P{(III) \(l) 7 (ll)} = l[P{F( mi . um . l ,y^)) |(I),(II)} 

i=i 

k 1 



2—1 



Finally, given (I), (II), and (III), and independently for each i = 1, . . . , k, Qi\ {p 1 } is precisely a 
uniform set of rii — 1 points, conditioned on F(m,i-i, rrii, y(p l ), Qi) holding, and p l is a uniform point 
on [rtii—rii, x |?/(p 4 )}. Furthermore, by Lemma 15, given E(rrii-i, m*, y(p 1 )), F(rrii-i, rrii, y{p 1 )), 
and G(mi^i,y(p 1 )), we have 

BG m *(P, {0, . . . , »}) = BG(Qi, . . . , mj). 

Thus, by the induction hypothesis, 

P{(IV) | (I), (II), (III)} 

k 

= H P {BG m ' (P, {0, . . . , »}) = Ui | S(mi_i, m„ 2/(p 1 )), F(m,_i, m„ 2/(p 2 )), y(p 1 )) } 

i=i 
k 



= II P { BG (^' { m i-ii • • • > TO *I) = ^ | F(m,i-i,mi, y{p 1 )) } 

i=l 
k 

= JJP{PGW(1) = Ui | |PGW(1)| =m}. (8) 



i=l 



Combining (5)-(8), and rearranging, we obtain that 

P {n has children Ui, . . . , in that order in BG(P Up, {0, ... , n}} 

n ~ l )li — ; Vi n(-) W, - p < PGW ( 1 ) = ^ l|PGW(i)i = m>, 



aut(r) -p-r n 



n 



II 1 ' i-i P {PGW(l) = U | | PGW(1)| = n} . 



the latter equality holding due to (4). To obtain P {BG(P U p, {0, . . . , n}) = U}, we now must sum 
this bound over distinct orderings of U\ , . . . , We instead sum over all permutations n : [k] — > [fc] , 
and note that this counts each distinct ordering aut(r) times. We thus obtain 



P{BG(PUp, {0,...,n})=C/} 



-P {PGW(l) = ?7 | | PGW(1)| = n} • ^ J| 



*7r(i) 



,r:[k]-»[k] tl" -1- 2^3 = 1 n ^U) 

By Lemma 38, the above sum is 1, which establishes (3) by induction and so completes the proof. 

□ 

In proving Theorem 35, we will use the following identity, which we quote in advance. 

Lemma 39. For integers a > 0, b > 0, let I a j, := Jq x a e~ bx dx. Then Ib-i.b — Ib.b = e~ b /b for 
each b > 0. 

Proof. Integration by parts. □ 
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The final step before proving Theorem 35 is to derive the the conditional distribution of £q — 
| BG° | given h$. As this will be useful later in the paper, we state it as a separate lemma. Write 

/ \ y p {N =n,ho G [y,y + dy)} 
ip y {n) = hm . 

Lemma 40. For all < y < 1 and all n > 1, 

Vy{n) = {l-y)- ■ (9) 

Proof. Fix n € N and < y < 1. In order to have £q = —n and ho — y, it is necessary and sufficient 
that E = E(~n,0,y : P),F = F(-n,0,y,P), and G = G(-n,y,P), from Lemma 14 all occur. We 
first calculate the density of the event E. 

P{|[-n,0) x [0,y)\ v =n-l,\[-n,0) x [y,y + dy)\ v = 1} 
=(1 + o(dy))P {Poisson(rty) = n — 1} • ndy 

=(l + o(dy)) , Kny > -ndy. 

{n-iy. 

Now let f E (y) = / B (0,n,y) = — ■ n. Given that |[-n,Q) x [0,y)|-p = n - 1 occurs, 

n, 0] x [0,y)) consists of n — 1 uniformly random points. Independently of this, given that 
|[— n, 0) x {y}|-p = 1, the line segment [— n,0] x {y} contains a single uniformly random point. By 
the first of the two preceding observations and by the cycle lemma, it follows that P {F \ E} = —. 
Furthermore, G is independent of E, and so by Lemma 24, P {G} = 1 — y. We thus have 

<p y (n)=P{F,G \E}f E (y) = P{G}P{F \ E} f E (y), 

from which the lemma follows. □ 



Proof of Theorem 35. We assume without loss of generality that n = 0. Let P = ([£ , 0] x [0, ho]) fl 
V, and let p = ([£o, 0] x {ft })n?. Then P is precisely distributed as a set of \£q\ — 1 uniform points 
in ([^0)0] x [0, h ]), conditional on F(£ o ,0,h ), and p has uniform distribution on ([^o>0] x {h Q }). 
The first claim of the theorem then follows from Lemma 37. Next, by Lemma 40, for any positive 
integer m we have 



P{\l \ = m}= f 
Jo 



(p y (m)dy 
ui m ~^ / \ 

7 7TT ( Irn—l.m Im.m J 5 (^) 

(m — 1)! V / 

where the notation I a ^ is that defined in Lemma 39. Applying that lemma, we obtain that 
P { | BG° | = to} is m m ~ 1 e~ m /m\ (exactly the probability that a PGW(l) has size to). The second 
claim of the theorem then follows from the first and the fact that the conditional distribution of 
PGW(A) given its size, is independent of A. □ 

Before proving Theorem 34, we first state a consequence of the above development. 

Corollary 41. For any positive integer n, any < y < I, and any unordered rooted tree U with 
\U\ = n, 

P |BG° = U, h £ dy) , 
lim — != — ^ ^- = (1 - y)nP {PGW(y) = U} . 

dy^O dy 

Proof. Immediate from (9) and Theorem 35. □ 
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Figure 6: In the above figure, an ordering u = (1/1,1/2,1/3) of the children of U* is fixed. Dashed 
edges lead from ao to its three children. In this example, j(U* ,U') = 2 since U\ and U% are 
(unordered, rooted) isomorphic to U' but U% is not, and «i(u) = 1, «2( u ) = 3 for the same reason. 
Finally, this example relates to the event E UtP with p = 1, since ao = 771^(11). 



Proof of Theorem 34- We will in fact prove that for any unordered rooted tree U' , conditional on 
BG° = U', BG°° \ BG° is distributed as PGW(l), which implies the statement of the theorem. 
Thus, let U and U' be unordered rooted trees with roots r and r' , and let U* be the unordered 
rooted tree with root r obtained by adding an edge between r and r'. We define 

E = {BG° = U', BG a ° \ BG° = U] 
= {BG° = U',BG aa = U*}. 

Next let k — k(U*) be the number of children of r in U*, and let j — j(U* , U') > 1 be the number 
of children of r in U* whose subtree is isomorphic to U' . Also, let aut(r) (resp. aut*(r)) be the 
number of permutations of the children of r in U (resp. U*) which induce automorphims of U 
(resp. U*). Note that aut*(r) = j ■ aut(r). 

Given an ordering u = (Ui, . . . ,Uk) of the children of r in U* , let i\ — ii(u), . . . ,ij — ij(u) 
be the set of indices i for which Ui is isomorphic to U' . For each £ = 0, . . . , fc, let mt = m^(u) = 
1 + Sg=^+i I Uq I (which is 1 when £ = k). Let 

i? u = E 1 n {ao has children U\, . . . ,Uk in that order}, 

and for each p = 1, . . . , j, let 



E u ,p — E u n {ao = Wi p (u)} = {mi p (u) has children U\, . . . , Uk in that order}. 

(The above definitions are depicted in Figure 6.) Then E = UuUp=i ^n,p, where the first union 
is over all distinct orderings u (we say u = (Ui, . . . , Uk) and u' = (U[, . . . , U' k ) are distinct if there 
is some i € {1, . . . ,k} for which Ui and U[ are not isomorphic). The double union is then over 
disjoint terms, and so 

P{£} = ^^P{£ U , P }. 

U p— 1 

Reversing the order of summation above, by translation invariance and Theorem 35, we obtain 

p—l U 

= j-P{BG° = £T} 

= j-P{PGW(l) = U*}. 
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Now fix some ordering U\, . . . , Uk of the children of r in U* with Uk = U'. Then by the definition 
of PGW(l) and the preceding equality, we have 

P -i u JL 

V ' 2—1 

= P { PGW(1) = U'}- e :\^-$ II P {PGW(l) = U } 

= P {PGW(l) = £7'} P {PGW(l) = U} . 
It follows by Theorem 35 that 

P |BG ao \ BG° = U I BG° = U'}= r P ^ r 

1 x 1 ; P{BG° = ?7'} 

= P{PGW(1) = U}, 

proving the theorem. □ 



4.2 An ancestral process in IPC(V, Z) 



By the end of this section we will have proved Theorems 1 through 3 from the introduction. To 
warm up, we prove the following theorem. 

Theorem 42. The subtree of IPC(V,1*) rooted at zero and containing only nodes with positive 
label, is distributed as PGW(l). 



Proof. By Proposition 18, each point of V in [0, 1] x [0, 1] yields a child of in To, so has Poisson(l) 
children. Let ro = and let r\ = min(i > : [cc (j>, ) J = 0), so t\ is "the first integer with 
as a parent". For any i > 0, the event that r\ = i is independent of V n ([i,oo) x [0, oo)), so r\ 
also has Poisson(l) children. More generally, let = min(i > r^-x ■ [x(pi)\ € {r , . . . , r^-i}). 
Then for all fc > 1 and all i > 0, the event that r^ — i is independent of V n ([i, oo) x [0, oo)), 
so r£, has Poisson(l) children. The nodes r , r\, . . . are precisely the descendants of 0, and we 
have just seen that each has Poisson(l) children independently of all the others. This proves the 
theorem. □ 



Heuristically, the fact that IPC(7-',Z) is equal in distribution to the IIC can be seen as follows. 
By symmetry, from Theorem 42, at each node of IPC(7 : ',Z) is rooted a copy of PGW(l). Also, 
by exploring the nodes of multiple trees in a left-to-right fashion as in Theorem 42, we see that 
the offspring distribution for distinct branches of IPC(7 : ',Z) are independent. Furthermore, the 
parent of in IPC^, Z) is more likely to be a node with many children than one with few children. 
This should "size-bias" the number of children of the parent of zero, in such a way as to precisely 
compensate for the edge from to its parent, so that a Poisson(l) number of children remain. 
The same argument should also hold for the parent of the parent of zero, and so on ad infinitum. 
It is possible to make (parts of) this heuristic argument rigorous; however, we obtain the result 
as a relatively direct byproduct of our argument for Theorem 3, whose proof requires a different 
approach. 

The key to the proof is the definition of a "backward maximum process" which is extremely 
similar to the forward maximal process. We begin by listing and its ancestors in IPC(7 : ',Z) in 
decreasing order as riQ,n\,n2, et cetera, so in particular uq = and in general n$+i = \x{p rLi )\. 
For i > let Wi — h ni , and let mi = maxo<j<i w 3 ■, the greatest weight of any of the first (i + 1) 
edges. In particular, too = wq = Hq. Finally, let io = and, for fc > 1, let ik be the smallest integer 
* > Zfc-i for which m ik > m ik l . Then the following lemma is basic (but important). 

Lemma 43. For all fc > 1, ni k — l(ni k _ x ), and so rrii k — h n . . 
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Proof. The fact that rii k < £(rii k _ 1 ) is immediate from Lemma 9. But I = £(ni k l ) is an ancestor 
of ni k _ 1 by Lemma 10, and hi > h nik i by Lemma 8. Thus, £(ni k l ) = rii k as claimed. □ 

Above, we derived the joint distribution of the height and length of Bq. We next show that the 
sequence {mi k }ken has a particularly simple and pleasing description. 

Lemma 44. The sequence {mi k }keN is a homogeneous Markov chain, and for all k, given m,i k , 
nii k+1 has distribution Uniform^i,. , 1] . 

Proof. For all < y < 1 and all k, hk < y if and only if the random walk L k,y has a chance. As 
remarked in Section 3.2, the probability of this is precisely y. Given mi k _ 1 and rii k , by Lemmas 8 
and 43, we know precisely that m ik — h ni > h ni i = m ik l . In other words, we know precisely 
that the random walk L rilfc ' m * fc - 1 has no chance. Thus, for < to < y < 1, 

P {m ik < y | m lk _ 1 = m } = P jfc^ < y | m ik _ 1 = ml 

= P {L ni fc ,y has a chance | L™^'™ 1 has no chance} 

1 P {L ni k' y has no chance} — P {L ni k> m has a chance} 
P {L ni k ' m has no chance} 

y — m 
1 — to' 

which proves the lemma. □ 

Note also, by the first remark in the proof of the lemma, we have the following proposition. 
Proposition 45 ([17], Theorem 2). ho = Uniform[0, 1]. 

We now prove a more substantial result, about the structure of the portion of IPC('P,Z) that 
lives "under the backward maximum process". It essentially states that, like the forward maximal 
process, the portion of IPC(V, Z) that lives under the backward maximum process looks like a single 
infinite backbone, to which subcritical Poisson Galton- Watson trees are attached at each point. 
Also, these subcritical trees become closer and closer to critical the further along the backbone 
from they are. 

Theorem 46. Let IPC~('P,Z) denote the restriction o/IPC('P,Z) to the non-positive integers, 
and let be its root. Then IPC _ (7 3 ,Z) is distributed as Tjjc- 

We will prove this theorem at the end of the section. For each k > 0, let Vk — V H ([rii k+1 , rii k ] x 
[0,TOjJ) and let I k = {n lk+1 , n lk+1 + 1, . . .,n lk }. By Lemma 26, given (n lk+1 -n ik ), IPC(P k ,T k ) is 
distributed as PGW(m,J conditioned to have (jii k+1 —rii k ) nodes, together with a single additional 
node (the node rii k ) attached to a uniform vertex. 

Theorem 47. For all k > and < to < 1, given that mi k = m, IPC('Pfc,Ifc) is distributed as a 
path with (l + Geometric(?7j)) edges, from rii k+1 to rii k , with an independent PGW(m) tree attached 
to each node of the path except rii k . 

Proof. The proof uses a correspondence between IPC(Vk,T k ) and a pond of IPC(7 : ',N) of appro- 
priate height. Let A > 1 be such that me"" 1 = Ae~ A , so then m = A(l — 0(A)). For all n > 1, by 
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Lemma 40 wc have 



p ("ifc+i - = " I = m) = (1 - m) 

1 — TO 



(n — 


1)! 


n n ~ 


-l 


(n — 


1)! 


n n - 


-l 



(me"" 1 )" 

x l-A(l-fl(A)) 
( j A(1-*(A)) 

/\„— A\n <?(A) 



(n-1)!' ' A0'(A) 
_ 6>(A) (An)""^-^ 
~ 6"(A) (n-1)! ' 

By Corollary 29, the latter is the probability that a pond of IPC^, N), conditioned to have height 
A, has size n. Thus, IPG^k^k) is distributed as a pond of IPC('P,N) conditioned to have height 
A. By Theorem 30, it follows that the length of the path from rii k+1 to rii k has distribution 

1 + Geometric(A(l — 0(A))) = 1 + Geometric(m). Furthermore, by Theorem 31, to each vertex of 
the path except n^ is attached an independent copy of PGW(to). This completes the proof. □ 



Having proved Theorem 47, we are now prepared for the last ingredient needed for the proofs 
of Theorems 1 and 46. 

Theorem 48. IPC('P,Z) is distributed as the Poisson IIC, in the local weak sense. 



Proof of Theorem 48. For each j > 0, let Tj be the subtree of IPC('P, Z) rooted at rij and containing 
all nodes reachable from rij without passing through nj—i or rij + i (so Tj contains neither rij—i 
nor nj + i). We show that independently for each j, Tj is distributed as PGW(l), which proves the 
theorem. 

For each j, let k = k(j) be the largest integer k for which ik < j. Let Uj be the subtree of Tj 
containing only nodes of index less than (in other words, Uj is the subtree of Tj which lives 
under the backward maximum process). Also, let Vj be the subtree containing rij and all nodes 
of Tj not in Uj. Then by Theorem 47, the Uj is distributed as PGW(TO, fc(;f) ), independently of 

Next, for each node £ G Uj, the number of children of £ in IPC(7 : ',Z) that are not in Uj is 
precisely the number of points of V in [£ — 1, £) x (m ikl .. ) ,1), and therefore has Poisson(l — m ik(j) ) 
distribution. Furthermore, all such children have strictly positive index by the definition of Uj. 

Finally, as in Theorem 42, let r\ — mm(i > : L^fe)] *= Uj). Then r\ has a Poisson(l) 
number of children, independently of Uj. More generally, exposing the descendants of Uj in a 
left-to-right fashion as in Theorem 42, we see that each descendant of Uj with positive index has 
a Poisson(l) number of children, independently of all the others. 

To sum up: Uj is distributed as PGW(mj fe ..,); each node of Uj independently has Poisson(l — 
mi k{j) ) children in Tj \ Uj; and each of these children is the root of a PGW(l) tree, independently 
of each other and of Uj. It follows that Tj is distributed as PGW(l), as claimed. □ 



Proof of Theorem 1. The two graphs T and T are BG(P,Z) and IPC(P,Z). Part (a) follows from 
Lemma 17. Part (b) is trivial. Part (c) follows from the example given in Figure 2. Part (d) is 
trivial. Part (e) follows from Theorems 34 and 48. □ 



Proof of Theorem 2. For each n, IPC^, {— n, — n + 1, ...}), viewed as rooted at 0, is distributed as 
IPC^, N), viewed as rooted at n. The theorem then follows from the fact that IPC^, {— n, —n + 
1, ...})-> IPC(V, Z) almost surely. □ 
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Proof of Theorem 3. Let q : [l,oo) — > (0,1] be the unique map satisfying the implicit equation 
d6 ^d\ X ^ = — 1 for all A G [0, oo). (It is possible to write down a more detailed - though still implicit 
- formula for q, but this is unilluminating and we omit it.) If W is a random variable satisfying 

P{IU < x} — 6{x) for all a; > 1, then q(W) = UniformfO, 1], from which the first part of the 
theorem follows immediately. The second part of Theorem 3 then follows from the first, together 
with Theorems 30, and 31. □ 

Proof of Theorem 46. Let U , U\,U%, ... be the subtrees of IPC(V, Z) introduced in the proof of 
Theorem 48. IPC~(V, Z) consists exactly of an infinite backbone path through vertices no, n\, n%, . . ., 
with Uj attached at rij for each j > 0. Each Uj is distributed as PGW(mj ). So all that remains 
to prove the theorem is to prove that the sequence (mi k(j) ) is distributed as the sequence (Mj)j> 
defined in the introduction. This follows from Theorem 47 (which states that the backward max- 
imum process weights J7ii fc ... stay constant for one plus a geometric number of values of j, with 
parameter dependent on the current weight mi fc . „ , and the fact that the sequence (mi k )k>o is as 
described in Lemma 44. □ 
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